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SUMMARY 


This  paper  determines  upper  bounds  on  the  expected   time  complexity  for  a 
variety  of  parallel  algorithms  for  undirected  and  directed  random  graph  problems. 
For  connectivity,  transitive  closure,  minimum  spanning  trees  and  all  pairs 
minimum  cost  paths  we  prove  the  expected  time  to  be   Odoglog  n)   for  the  W-RAM 
(this  parallel  RAM  machine  allows  resolution  of  write  conflicts)  and 
O(log  n  .  loglog  n)   for  the  P-RAM  (which  allows  simultaneous  reads  but  not 
simultaneous  writes).   We  also  show  that  the  problem  of  graph  isomorphism  has 
expected  parallel  time   Odoglog  n)   for  the  W-RAM  and  0  (log  n)   for  the  P-RAM. 
Our  results  also  improve  known  upper  bounds  on  the  expected  space  required 
for  sequential  graph  algorithms.   For  example,  we  show  that  the  problems  of 
finding  connected  components  transitive  closure,  minimum  spanning  trees  and 
min  cost  paths  have  expected  sequential  space   O(log  n  •  loglog  n)   on  a  deter- 
ministic Turing  Machine,   We  use  a  simulation  of  the  W-RAM  to  get  these  expected 
sequential  space  bounds . 


WORST  CASE  INPUT 


PROBLEM 

P-RAM 

W-RAM 

Connectivity 

T=e (lognlogd)             t=e(logn) 

3                                        3 
TT    -  n  /logn                 it   =  n   logn 

[Ja'    Ja',    78]              [Reif,82A] 

T=e(log^n) 

'    -  n  /log  n 
[Chin   et   al.Bl] 

T=9(logn) 

TT=n+n 
[Shiloah, 
Vishkin,81] 

K-Connectivity 
K  Blocks 

t=e(logn) 

lReif,82A] 

TT   =  n    logn 

Q 

M 

Q 
2 

Biconnected 
Components 
(k    is    the   number   of 
components    and      d      is 
the   depth   of    the   graph 

T=e (lognlogdlogk)       T=e(logn) 

"n   =  n   /logn                   'n'=     n   logn 
[Ja' Ja" ,78]                      [Reif ,82] 

Min   Cost    Spanning 
Trees 

T=e{log^n)                       T=e(logn) 

.   =  n^                              7T=n«(l> 

[Savage,    Ja'Ja',81]      [Reif,S2A] 

All   Pairs    Shortest 
Paths 

T=e(log^n) 

3 

•n    =   n 

[Dekel   et   al. ,81] 

Q 

M 

Transitive   Closure 

and 
Strong  Components 

T=e(lognlogd) 

T^  =   n  /logn 
[Ja'Ja' ,78] 

Min    Cost    Spanning 

Trees    and 
K- Connectivity 

T=e(logn) 

TT=n 
[Reif  ,82A] 

Isomorphism 

Table  Tl 
Previously  Known  Results  for  Worst  Case  Input 

TT  is  the  number  of  processors  on  worst  case. 

T  is  parallel  time  for  worst  case  input. 

T  is  expected  parallel  time  for  worst  case  input. 

n   is  the  number  of  nodes  and  m   is  the  number  of  edges 


RANDOM   INPUT 


PROBLEM 

P-RAM 

W-RAM 

UNDIRECTED 

Connectivity 

T=0(logn  -loglogn) 

2 
fT  =  n   /logn*  loglogn 

T=0( loglogn) 

_                  2 
TT  =   n+pn 

Min   Cost   Spanning 
Tree 

T=0 (logn* loglogn) 

IT  =  pn^ 

T=0 (loglogn) 

3 

TT  =   pn 

All   Pairs    Shortest 
Paths 

T=0 ( logn • log logn ) 

IT  =   n   /logn 

T=0( loglogn) 

3 
TT  =   n 

Biconnected 

Components 

_                                          2 
T=0 (logn' (loglogn)    ) 

Tf  =   n   /logn 

T=0( (loglogn)    ) 

3 

TT   =   n 

Q 
Q 

Transitive    Closure 

and 
Strong  Components 

T=o (logn* loglogn) 

7T   =   n   /loan 

T=0( loglogn) 

3 

TT   =   n 

Minimum  Cost   Paths 
(all  pairs) 

T=0 ( logn • loglogn ) 
TT    =   n    ,^logn 

T=0( loglogn) 

3 

TT   =   n 

Minimum  Spanning 
DAG 

T=0( loglogn) 

Isomorphism 

T=0(logn) 
TT    =   n   /logn 

T=o (loglogn) 

2 

TT    =   n 

Table  T2 
Our  results    for     Random   Input 

T  =  expected  parallel  time    for  Random  Input 

TT  =  expected  number  of  processors    for   Random  Input 


RANDOM  INPUT 


Problem 

Expected  Deterministic   Sequential   Space 

UNDIRECTED 

Connectivity 

0 ( logn • loglogn) 

Min    Cost    Spanning 
Tree 

O(logn'loglogn) 

All    Pairs    Shortest 
Paths 

0 ( logn • loglogn ) 

Biconnected 
Components 

0 (logn* (loglogn)    ) 

Q 

H 

U 

M 

Q 

Strong  Components, 
Transitive    Closure 

0 (logn* loglogn) 

All    Pairs    Shortest 
Paths 

0 (logn 'loglogn) 

Minimum  Spanning   DAG 

0 (logn -loglogn) 

- 

Isomorphism 

O(logn* loglogn) 

Table  T3 
Our  results  for  Expected  Sequential  Space  on  Random  Input 
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1.    INTRODUCTION 

1  . 1   The  Parallel  Machine  Models 

We  consider  here  some  fundamental  models  of  parallel  computation,  all  of 
which  assume  the  presence  of  an  unlimited  number  of  processors.   In  the  first 
model,  the  P-RAM  of  [Wyllie,  79],  [Fortune,  Wyllie,  78],  each  processor  is 
capable  of  performing  arithmetic,  boolean  and  certain  read  and  write  operations. 
In  the  P-RAM,  different  processors  can  read  the  same  memory  location  at  the 
same  time.   They  may  store  information  at  different  memory  locations  simul- 
taneously, but  no  two  processors  can  attempt  to  change  the  contents  of  the 
same  memory  cell  on  the  same  step.   [Reif,  82A,B]  proposed  a  probabilistic 
P-RAM  where  processors  are  capable  of  doing  independent  probabilistic  choices 
on  a  fixed  input  (but,  again,  simultaneous  writing  at  the  same  location  is 
not  allowed) . 

In  another  proposed  machine  model,  the  W-RAM,  simultaneous  access  to  the 
same  memory  location  is  allowed  for  both  read  and  write  operations.   In  the 
case  of  a  simultaneous  write  attempt,  exactly  one  processor  succeeds  but  we 
make  no  assumption  of  which  one  succeeds.   (See  [Shiloah,  Vishkin,  80]  and 
[Goldschlager,  78]  for  similar  models.) 

A  time  bound  for  a  problem  in  the  W-RAM  can  be  translated  into  a  space 
bound  for  the  same  problem  in  a  deterministic  Turing  Maching  by  the  following 
Lemma. 

LEMMA  1.1.  Let     L     be  a   language  accepted  by  a     T(n)  time-bounded 
W-RAM,  with  a  polynomial  number  of  processors .     Then     L  is  accepted  by  a 
0 {T(n) -maxiT (n) ,log  n})  space  bounded  deterministic  Turing  Machine. 


For  a  proof,  see  [Goldschlager ,  78],  [Cook,  80]  and  [Ruzzo,  82].   The 
result  has  been  proved  for  the  SIMDAG  Machine  which  is  more  powerful  than 
the  W-RAM.   It  is  also  clear  that 

PROPOSITION:  Any  parallel  algorithm  for  the  W-RAM  of  time  T  implies 
a  parallel  algorithm  on  the  p-RAM  model,  of  time  at  most  0(T-log  m)  where 
m  is  the  maximum:  .:onber  of  processors  which  could  possibly  compete  for  the 
same  memory    location. 

Proof.   One  can  simulate  the  nondeterministic  choice  of  write  conflict 
resolutions  by  a  tree  of  depth   log  m  of  pairwise  selections  to  find  a 
unique  winner.  n 

I . 2   Known  Parallel  Graph  and  Digraph  Algorithms  and  Open  Problems 

We  list  here  previous  results  and  also  provide  a  table  (Table  1) .   The 

best  known  deterministic  algorithm  for  undirected  graph  connectivity  is  of 

2  2    2 

5  (log  n)   time  in  P-RAM  with   n  /log  n  processors,  [Chin,  Lam,  Chen,  81]. 

[Hirschberg,  Chandra,  Sarwate ,  79]  presented  a  version  of  the  above  algorithm 

2 
which  used  G (n  /log  n)   processors, 

Def  in  i  t  i  on.   Let  G=  (V,E)   be  a  (directed  or  undirected)  graph  and  let 

d(v,w)   for   v,w  V  be  the  length  of  the  shortest  path  from  v   to   w  if 

any,   -°^  otherwise.   Then  the  depth,      d(G),  of  the  graph  G  is  defined  to  be 

the  max   ^  {d(v,w),l}.   Note  that  for  undirected  graphs   G  with  nontrivial 
v,wtv 

connected  components,   d{G)   is  the  maximum  of  the  diameters  of  the  connected 
components  of  G.   [Savage,  Ja'Ja',  81]  remarked  that  the  [Hirschberg,  et  al., 
79]  algorithm  is  actually  of  time  6 (log  n-min{log  n,d;)   where  d  is  the  depth 
of  the  graph. 
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The  best  known  deterministic  algorithm  for  undirected  connectivity  in  the 
W-RAM  is  due  to  [Shiloah,  Vishkin,  81]  and  is  of   6 (log  n)  time  with   6{n+m) 
processors,  where  m  is  the  number  of  the  edges  of  the  graph.  [Shiloah,  Vishkin  81] 
had  the  ingenious  idea  to  use  distributed  collapsing  of  the  height  of  the  data 
structures  which  are  candidates  for  connected  components.   Their  algorithm  is 
essentially  different  from  that  of  [Hirschberg  et  al,  79]  in  the  sense  that  the 
algorithm  of  [Hirschberg  et  al,  79]  cannot  be  adjusted  to  run  in  time  9  (log  n)  in 
the  stronger  model  W-RAM.   Also,   Ja'Ja',  78   has  an  6(log  n  log  d)  time  and 

n  /log  n  processors  algorithm  for  the  P-RAM,  where  d  is  the  depth  of  the  graph.  For 

2 
the  biconnected  components,  [Ja'Ja',  Savage,  81]  give  an  6(log  n)  time  algorithm 

in  deterministic  P-RAM  with  n  /log  n  processors  and  also  an  6  (log  n'log  d*log  k) 

2 
time  algorithm  with  mn+n  log  n  processors  where  k  is  the  number  of  components  and  m 

is  the  number  of  edges  of  the  graph.   For  the  minimum  spanning  tree  problem  [Ja'Ja', 

2  2 

Savage,  81]  give  an  6 (log  n)  time  algorithm  with  n  processors.   For  all  the  above 

undirected  graph  problems,  [Reif,  82A]  gives  an  6 (log  n)  time  probabilistic 

6(1) 
algorithm  for  the  P-RAM,   requiring  n     processors.   Note  the  results  of  [Reif,  82A] 

are  not  applicable  to  digraph  problems. 

For  transitive  closure  and  strong  components  in  digraphs  the  best  result 
[Ja'Ja',  78]  for  the  P-RAM  is  of   9  (log  n-log  d)   time  with  nVlog  n 
processors.   For  all  pairs  shortest  paths  in  digraphs,  [Dekel,  Massimi ,  Sahni , 
81]  give  an  9 (log  n)   algorithm  with   n   processors.   No  parallel  algorithm 
for  isomorphism  was  known  up  to  now. 

The  above  discussion  of  previous  work  indicates  several  basic  unanswered 
questions : 

(a)  Can  parallel  time  be  dropped  below  ^  (log  n)   for  directed 
connectivity  problems  on  the  P-RAM? 

(b)  Can  parallel  time  be  dropped  below  T/dog  n)   for  connectivity,  bi- 
connectivity,  minimum  spanning  tree,  etc.  on  the  W-RAM  model? 
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(c)   Can  parallel  time  for  undirected  connectivity  be  dropped  below 
nciog  n)   and  simultaneously   the  number  of  processors  be  dropped  below 
nCn^/log^n)   on  the  P-RAM? 

This  paper  gives  affirmative  answers  to  the  above  questions  if  the  words 
"parallel  time"  and  "number  of  processors"  are  preceded  by  the  word  "expected". 

1 . 3   Known  Sequential  Space  Bounds  for  Graph  Problems 

A  space  efficient  sequential  vrobabilisvic  search   technique  was  first 
introduced  by  [Aleliunas,  Karp ,  Lipton,  Lovasz  and  Rackof f ,  79]  to  test 
connectivity.    [Lewis  and  Papadimitriou,  80]  observe  that  this  technique 
immediately  implied  a  space   0(S(n))   probabilistic  algorithm  to  simulate 
a  symmetric  nondeterministic  machine  with  space   S(n).   [Reif,  82A]  extended 
the  above  techniques  to  yield   O(log  n)   probabilistic  sequential  space 
algorithms  for  many  undirectly  graph  problems.   However,  the  above  do  not 

extend  to  directed  graph  problems,  for  which  the  problem  of  achieving  less 

2  2 

than     .".(log  n)      sequential   space   is   still   open.       (The     f2  (log   n)      space  bound 

is  implied  by  the  theorem  of  [Savitch,  70].) 

1 . k      Average  Performance  of  Graph  Algorithms  and  Random  Graphs 

Considerable  work  has  been  done  on  sequential  graph  algorithms  which  are 
fast  on  the  average  given  random  input  graphs.   This  includes  the  work  of 
[Angluin,  Valiant,  79],  [Karp,  76],  [Karp,  Sipser,  81],  [Schnorr,  78],  [Karp, 
Tarjan,  80],  [Reif,  Spirakis ,  81],  [Spirakis,  81]. 

Almost  no  previous  work  examined  the  average  performance  of  parallel 
graph  and  digraph  algorithms,  with  the  exception  of  [Shamir,  Upfal,  82]  on 
coloring  random  graphs  and  [Karp,  Sipser,  81]  on  parallel  matchings  in 
random  graphs.   Furthermore,  no  previous  work  analyzes  the  expected  sequential 
space  of  graph  algorithms. 


We  analyze  here  the  average  performance  of  parallel  algorithms  for  con- 
nectivity, biconnected  components,  strong  connectivity  and  transitive  closure, 
minimum  cost  spanning  tree,  minimum  cost  all  pair  shortest  paths  and  graph 
isomorphism.   Our  analysis  also  gives  upper  bounds  on  the  expected  sequential 
space  complexity  of  the  above  problems. 

The  input  to  our  algorithms  have  is  assumed  to  be  a  random  graph  of  the 

model   G     as  defined  in  [Erdos,  Renyi ,  60],  or  a  random  digraph   D     as 
n,p  n,p 

defined  in  [Anguin  and  Valiant,  79]. 

In  the  Appendix  we  investigate  the  probability  distribution  of  the  depth 

of  a  random  qraph  G     or  a  random  digraph   D     (the  depth  was  defined  in 

^      ^  n,p  n,p 

§1.2   of   this  paper).      The  mean  value     d      (and  bounds   on  the   probability   distri- 
bution)   of  the   depth   is    crucial   to  a  number  of  our   results.      We  prove  the    following 
theorems : 

THEOREM    1.1.      There  is  a  constant     c    >2       such  that  for  any  probability     p 

[c        2c  /c   \    21 

—  , -( — j  the  graph     G  has  average  depth 

d=0(log  n)  .      Furthermore^      prob{d=0{log  n) }      tends  to  1  as     n     to     °°. 

THEOREM   1.2.      There  is  a  constant     c    >2     such  that  for  any  probability     p     in 
the  range      [0,1]  -      1  - -J  1 — -  ,   —     ,      the  digraph     D  has  average  depth 

d  =  0(log  n)  .      Furthermore y      prob{d=0(log  n)  }      tends  to   1  as     n     tends   to     °°. 

We   also  prove   that   for     p^— 5 (c      a  particular   constant     >1)      the 

i^n" 

depth  d  of  G     (or  D   )  is  less  than  or  equal  to  3  with  probability  ^1 

n,p        n,p 

as  nH<c  .  Although  the  results  d  =  0(log  n)  can  be  deduced  from  results  of 
[Erdos,  R^nyi,  59]  for  several  values  of  p  (see  Appendix) ,  the  results  for 
sparse  graphs  with  edge  probability  p  =  6  (— )   and  the  result  for  very  dense 

graphs   (p^— ^ )   are  new  contributions. 

v'n 
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] .5   Our  Expected  Time  Results  for  Parallel  Graph  Problems 

Using  the  above  facts  and  some  nontrivial  analysis,  we  show  that  one  can 
find  the  connected  components  of  a  random  graph   G     and  the  strong  com- 
ponents and  transitive  closure  of  a  random  digraph  D     and  also  the  minimum 
spanning  tree  of  a  weighted  random  graph  and  the  all-pairs  shortest  paths  of  a 
weighted  random  graph  or  digraph  in  average  parallel  time   Odoglog  n)   is  the 

W-RAM  and  Oilog  n  •  loglog  n)   is  the  P-RAM.   The  average  time  for  biconnected 

2  2 

components  is   0( (loglog  n)  )   for  the  W-RAM  and   O(log  n  •  (loglog  n)  )   for 

the  P-RAM.   The  average  time  for  graph  isomorphism  is   Odoglog  n)   in  the 

W-RAM  and  0{log  n)   in  the  P-RAM.   (See  also  Table  T.2).   The  results  for 

digraph  connectivity  and  digraph  and  graph  all-pairs  shortest  paths  are  implied 

by  the  result  that  the  average  deoth  of   G     or   D     is   d=0(log  n) 

n,p       n,p 

(see  Section  1.4). 

However,  our  result  for  undirected  connectivity  in  the  W-RAM  (and  the 
result  for  minimum  spanning  tree  in  the  W-RAM)  are  not  implied   by  the  fact  that 
d=0(log  n)  .   A  large  part  of  our  paper  is  devoted  to  a  new  algorithm  (and 
probabilistic  analysis)  for  undirected  connectivity  in  the  W-RAM.   A  side- 
benefit  of  this  new  algorithm  is  a  result  about  the  expected  number  of  processors 

2 
needed  for  undirected  connectivity  in  the  P-RAM  model,  which  is   n  /(log  n'loglog  n) 

for  time  0  (log  n« loglog  n) . 

The  above  results  answer  the  (open)  questions  (a),  (b) ,  (c)  of  Section  1.2 

of  this  paper  in  an  affirmative  way,  given  that  one  concentrates  in  average 

case  analysis. 

2 
We  also  show  that  the  expected  number  of  processors  is   n  +  pn   for 

undirected  connectivity  in  the  W-RAM  (and  this  is  linear  for  pn  =  9(1)).   For 

3 

directed  connectivity  and  transitive  closure,   n  /log  n  processors  are  needed 

.for  the  P-RAM  model  and  n   processors  are  expected  for  the  W-RAM  model. 


For  bi connected  components  we  need  n  /log  n  and  n   for  the  P-RAM  and  W-RAM, 

2 
respectively.   For  graph  isomorphism  we  need  n   processors  for  the  W-RAM  and 

2 

n  /log  n  for  the  P-RAM.   For  the  minimum  spanning  tree  (all  pairs  shortest 

2         3  2 

paths)  we  need  n  /log  n  (n  /log  n)   processors  for  the  P-RAM  and  n  p 

processors  (n  p  processors)  for  the  W-RAM  on  the  average. 

We  remar)<  there  is  generally  a  considerable  drop  in  time  complexity  when 

one  concentrates  on  the  average  case  of  the  above  problems  rather  than  worst 

case  input.   For  sake  of  comparison  we  list  previous  results  at  Table  Tl  and 

our  time  complexity  results  at  Table  T2.   Note  that  none  of  the  previously 

known  algorithms  for  undirected  connectivity  implies  an  expected  time  below 

2 
Cl  (log  n)   in  the  P-RAM  and  a  simultaneous  drop  of  expected  number  of  processors 

2 
below  n  /log  n  even  if  the  average  depth  result  d=0(log  n)   is  used.   Note 

also  that  our  results  for  W-RAMs  can  be  applied  to  SIMDAGs  and  other  parallel 

computation  models  allowing  resolution  of  write  conflicts  (see  [Kucera,  81]). 

1 . 6   Our  Results  for  Expected  Space  Complexity 

Our  results  improve  known  upper  bounds  on  the  expected  space  required  for 
sequential  graph  algorithms.   By  using  our  expected  parallel  time  results  for 
W-RAMs  and  Lemma  1  we  provide  an  upper  bound  of  O(log  n-loglog  n)   for  the 
expected  deterministic  sequential  space  needed  for  undirected  connectivity, 
directed  (strong)  connectivity  and  transitive  closure,  min  cost  spanning  tree 
and  directed  and  undirected  all  pairs  shortest  paths,  and  graph  isomorphism. 
[Reif ,  81A]  provides  0 (log  n)  probabilistic  sequential  space  bounds  for  the 
undirected  versions  of  all  the  above  problems,  but  his  method  does  not  seem  to 
extend  to  directed  graph  problems.   Our  results  answer  the  question  posed  in 
Section  1.3  in  an  affirmative  way  when  one  concentrates  on  expected  space  and 
our  methods  could  be  viewed  as  indirect  ways  of  getting  results  about  expected 
sequential  space. 
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2.    EXPECTED  PARALLEL  TIME  FOR  DIGRAPH  CONNECTIVITY 

The  following  well-known  algorithm  for  computing  the  transitive  closure  of 
an  nx  n  boolean  matrix  A   can  be  used  for  digraph  connectivity  and  transitive 
closure: 

input  A 
begin 

B  ^  A  +  I 
L:     B"  ^  B 

B  *-    B-B 

if   B'  7^  B   then  goto  L  else  output  B 
end 

2 

This  algorithm  was  shown  in  [Dekel,  Nassimi,  Sahni,  81]  to  be  of   6; log  n) 

worst  case  time  by  using   n   processors  in  the  P-RAM  model.   [Ja' Ja' ,  78]  did 
an  efficient  implementation  of  this  algorithm  showing  that  it  could  be  done  in 
9(log  n-log  d)   time  on  the  P-RAM  model  and  by  using   n  /log  n  processors, 
where  d  is  the  depth  of  the  digraph   D.   Note  that  we  can  drop  the  number  of 
processors  to  n  "2  /log  n  by  applying  Strassen's  algorithm  for  matrix 
multiplication.   It  is  easy  to  observe,  that  the  primitive  set  of  operations 
of  [Ja'Ja',  78]  (i.e.,  the  union  of   0 (n)   sets  of   0{n)   integers  each,  from  1 
to  n)  can  be  done  in   0(1)   time  in  the  W-RAM,  implying  an   O(log  d)   algorithm 
for  strong  components  and  transitive  closure  (see  also  [Kucera,  82]).   Using  our 
results  on  the  average  depth  of  D     given  in  the  Appendix  immediately  imply; 

THEOREM  2.1.  The  average  parallel  time  for  transitive  closure  of  directed  graphs 
is     Odoglog  n)  for  the  Vi- BAM  model  and     Odog  n.loglog  n)  for  the   P-RAM 
model.      The  probability  that  the  parallel  time  is  more  than  this  is  less   than 
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-V  3  3 

n  ',  Y>2.  The   w-ram  model  uses     n  proaessors  and  the   p-ram  model  uses     n  /log  n 

processors  for  the  stated  time  bounds. 

[Ja'Ja' ,  78]  gave  also  an  algorithm  for  finding  the  strong  components  of 

a  digraph  in  parallel  time   6(log  n  log  d)   in  the  R-RAM  model,  by  using 

n  /log  n  processors.   Our  results  on  the  depth  of   D     also  imply: 

n,p 

THEOREM  2.2.     The  average  parallel   time  for  finding  the  strong  components  of  a 

digraph  in  the  p-ram  model  is     O(log  n»loglog  n) ,    using     n  /log  n     processors. 

3 
In  the  W-RAM  model  the  average  time  is     Odoglog  n)      using     n        processors. 

-Y 
The  probability  that  the  parallel   time  is  more   than  this,    is   less   than    n 

for  some    y  >  2. 

By  using  Lemma  1.1  we  also  get 

THEOREM  2.3-  The  average  deterministic  sequential  space  needed  for  transitive 
closure  and  strong  components  of  digraphs  and  connected  components  of  undirected 
graphs  is     O(log  n.loglog  n) .  The  probability   that  the  space  needed  is 
0  (log  n-loglog  n)  is  at   least     1-n    for  some    y  >  2. 

(Note  that  the  same  techniques  can  be  applied  to  undirected  graphs  to  compute  the 
connected  components  by  the  P-RAM  in   O(log  n-loglog  n)   average  time.   However, 
the  number  of  processors  used  is  wasteful.   In  the  next  section  we  give  a 
derivation  of  this  result  which  is  independent  of  our  results  about  depth  of 
G     and  optimizes  simultaneously  over  expected  time  and  expected  number  of 
processors.) 
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3.    EXPECTED  PARALLEL  TIME  AND  PROCESSOR  BOUNDS  FOR  CONNECTIVITY  IN  UNDIRECTED 
GRAPHS 

3.0  A  New  Probabilistic  Connectivity  Algorithm 

We  present  here  a  new  algorithm  for  finding  connected  components  of  a  graph 
on  a  W-RAM.   Although  it  is  related  to  the  algorithm  of  [Shiloah,  Vishkin,  82], 
it  uses  probabilistic  choice  so  the  processors  of  the  W-RAM  model  are  allowed 
to  independently  choose  ramdom  bits.   Also,  we  assume  random  resolution  of  write 
conflicts.   If  k  processors  attempt  to  write  simultaneously  at  the  same 
memory  location,  then  exactly  one  succeeds  and  the  probability  that  each 
particular  one  succeeds  is   1/k.   Our  algorithm  for  graph  connectivity  has 
O(loglog  n)   expected  parallel  time  in  the  W-RAM  which  leads  to  an 
O(log  n-loglog  n)   expected  parallel  time  algorithm  in  the  P-RAM.   Our  algorithm 

uses   2n + m  processors  for  a  graph  of  n  vertices  and  m  edges,  hence  it  uses 

2 
2n + pn   processors  on  the  average.   An  improvement  over  this  is 

2 
n  /(log  n-loglog  n)   processors  will  be  shown  in  section  3.2. 

During  the  computation  each  vertex  v  has  a  pointer  field  D(v)   through 

which  it  points  on  another  vertex  or  to  itself.   One  can  regard  v-«-D{v)   as  a 

(directed)  edge  in  an  auxiliary  graph,  called  the  pointers -graph .      In  the 
probabilistic  part  of  our  algorithm,  this  pointers-graph  will  always  be  a 
collection  of  sets  called  super-vertices  each  of  which  is  a  rooted  star  of 
height  one,  with  a  self- loop  at  the  roots.   As  the  algorithm  proceeds,  the  number 
of  sets  decreases  while  each  individual  set  increases  or  disappears.   This  is 
caused  by  a  "hooking"  operation  in  which  a  supervertex  can  be  hooked  on  anof.  r 
supervertex.   In  each  phase  of  the  algorithm  each  of  the  supervertices  decide 

(probabilistically,  depending  on  their  size)  to  be  either  aoaeptors   or  proposers. 
Only  proposers  can  hook  thenselves  to  other  supervertices  which  must  be  acceptors. 
Partial  collapsing  is  done  in  each  phase  so  that  supervertices  remain  being 
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rooted  stars  of  height  one.   At  the  end,  each  remaining  set  is  a  connected 
component  of  the  graph  and  is  still  a  rooted  star. 


Let   s (t)   be  the  following  sequence: 

s{l)  =  21og^n 
and  for   i>l,   s(i  +  l)  = 


2^(l  +  £)log'*n 


(£   is  a  small  constant,  to  be  determined  later.)   We  now  describe  the  algorithm 
in  detail. 

Algorithm  MERGE 

Incut:   Graph  G=  (V,E) ,  an  instance  of  the  random  graph  G    .   We  shall 

— ' n,p 

use   D  (v)  =  u  to  denote  that  an  edge  from  vertex  v  £  V  points  to  vertex   u 

th 
after  the   t    pnase. 

Initially,  at  phase  1,  for  each   '^V,   {v}   is  a  supervertex,  these  are  the 
only  supervertices ,  each  supei'-'ertex  is  declared  tavge     and  D  (v)  =v  for  each 
vertex  v. 

In  the  following  text  we  will  use  "vertex"  and  the  assigned  processor  to  that 
vertex  interchangingly .   We  allocate  a  vertex  processor  to  each  vertex.   Each 
undirected  edge  iu,v}  is  viewed  as  two  ordered  pairs  <u,v>,  <v,u>£vxv  and 
we  allocate  one  edge  processor  to  each  such  pair. 

The  algorithm  has  3  stages.   Before  the  start  of  the  first  stage,  each  edge 
processor  draws  a  random  number  between  0  and  1,  uniformly.   We  require  that 
<u,v>  and  <v,u>  draw  the  same  number,  but  otherwise  the  choices  are  independent. 

Those  processors,  whose  number  is  less  than  — ,  participate  in  the  first  stage 

1     2 
only.   Those,  whose  number  is  between  —  and  — ,  participate  in  the  second  stage 

only.   The  rest  participate  in  the  third  stage.   This  procedure  results  in 

partitioning  the  instance  of  the  random  graph  G     into  3  instances  of  a  random 
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graph  G    ..,  in  a  way  guaranteeing  that  edges  used  in  one  stage  are  not  reused 
in  another  stage. 

Stage  1 .   we  apply  the  [Shiloah,  Vishkin,  82]  connectivity  algorithm  for 
41oglog  n  of  its  phases. 

Stage  2.   It  consists  of  < d  loglog  n  phases.   (d   is  a  positive  constant,  its 
exact  value  is  determined  later.)   The  pairs  of  processors  associated  with  each 
edge  and  with  stage  2,  draw  in  the  beginning  of  each  phase  of  stage  2,  with 
probability  —  .   Those  who  succeed,  are  used  only  for  that  phase.   Those  unused, 
try  again  in  the  next  phase.   (This  sampling  of  edges  guarantees  that  an  edge 
is  not  used  in  two  different  phases  of  stage  2  and  hence  the  randomness  of  the 
input  is  not  destroyed.) 

Set  i  =  1. 

Execute  the  following  loop  until   i =  d  loglog  n 


■Phase  i  of  stage  2. 

(a)  Let  £   be  a  positive  constant  greater  than  3,  to  be  determined 

s  (i) 

later.   For  each  supervertex,  construct  a  table  of  size  ^r-; .   Initially, 

^  p  log  n  -' 

all  entries  of  the  table  are  0.   Each  vertex  of  the  supervertex  sets  a  randomly 
chosen  cell  of  the  table  to  1  on  this  phase.   We  declare  a  supervertex  to  be 
large   if  all  its  cells  are  foxind  to  be  1. 

(b)  Let  K   be  a   constant  greater  than  4,  whose  exact  value  is  determined 

later.   Let  the  root  of  each  large  supervertex  choose  to  be  an  aaoeptor   with 

log  n 

probability  7^-^  .   (a   is  a  small  positive  constant  >  1 ,  to  be  determined 

a*s  (1) 

later.)   All  other  supervertices  are  declared  pXOpOiiVUi . 

(c)  Each  proposer  supervertex  chooses  a  random  edge  departing  from  it  to 

an  acceptor  (if  such  an  edge  exists)  and  merges  into  that  acceptor.   This  is  done 
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in  constant  time  on  the  W-RAM  as  follows:   All  the  processors  associated  with 

vertices  of  the  proposer  check  whether  their  neighbors  are  nodes  of  an  acceptor. 

If  one  finds  such  a  neighbor  u,  it  tries  to  hook  its  supervertex  on  D.  -,  (u)  • 

If  more  than  one  try  this,  then  each  one  is  allowed  to  succeed  with  equal 

probability  by  use  of  random  resolution  of  simultaneous  writing. 

(d)   (Collapsing)  :   For  all  vertices   v  we  assign  D.  (v)  =  D.  ,  (D.  ,  (v) ) . 

1      1-1  1-1 

Notice  that  phase   i   takes  a  constant  number  of  steps  on  a  W-RAM.   To  do 
(a)  in  constant  time,  we  must  place  the  assigned  table  in  contiguous  memory.   In 
addition,  we  will  assume  a  distinct  register,  which  is  set  to  1  at  the  beginning 
of  (a) .   The  root  of  each  currently  large  proposer  writes  1  into  a  randomly 
chosen  memory  cell  of  the  root  of  the  acceptor  it  proposes  to.   Then,  each  of 
the  vertices  of  the  acceptor  selects  randomly  one  of  the  cells  of  their  root, 
and  if  any  chosen  cell  has  value  0,  then  it  writes  a  0  in  the  register  of  the 
root  of  the  acceptor.   Otherwise,  they  do  nothing.   The  root  of  the  acceptor 
then  reads  its  register  and  declares  the  acceptor  targe   if  the  register  has 
value  1  and  otherwise  declares  the  acceptor  small.      Finally,  each  currently 
large  proposer  then  resets  its  chosen  memory  cells  to  be  0,  and  the  root  of 
the  acceptor  sets  its  register  to  1. 

To  do  (b)  in  constant  time  on  the  W-RAM,  we  let  each  root  of  a  large 
supervertex  to  make  first  an  independent  probabilistic  choice  and  then  have 
the  processors  associated  with  each  node  of  the  set  read  the  result. 

Stage  3.   We  run  the  (deterministic)  connectivity  algorithm  of  [Shiloah,  Vishkin, 
82]  until  the  number  of  supervertices  does  not  change. 

Finally,  output  the  connected  components.   These  are  given  by  the  super- 
vertices  remaining. 
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3, 1   Proof  of  Correctness  of  Algorithm  MERGE 
We  prove  in  this  section  the  following: 

THEOREM  3.1.  Algorithn   MERGE  computes   the  connected  components  of    g     with 

n  /P 

p>c/n     arnd     c^6,    in  expected  parallel  time     Odoglog  n)      in  the  W-RAM.      The 

2 
expected  number  of  processors  -used  is     0(n  +  pn  ).      The  probability  vhat  the 

parallel   time  exceeds      Odoglog  n)      is      o  (— ) .       (Note   that  with  a  constant 

2  , 

factor  of  time  increase  we  can  reduce  the  niauber  of  processors  to   n + pn  . ) 

Proof  of  Theorem  3. ' •   Let  K  be  the  number  of  supervertices  at  the  end  of 

1  ,   4 
stage  1.   Let  K   be  the  number  os  supervertices  of  size  greater  than  —  log  n 

and  K   be  the  number  of  supervertices  of  size  less  than  or  equal  to  —  log  n. 

Then  K  is  at  most  j—   (since  the  number  of  supervertices  drops  by 

1  ^°5  n    n 

—  on  each  phase) .   Also,  at  least  —  of  the  nodes  will  be  in  supervertices  of 

4 
size  at  least   2  log  n  .   If  the  opposite  was  true,  then 

n   1     4         4 
(1)   K   is  less  than  (—)/[—   log  n)  =  n/log  n 

n    1    4  4 

and   (2)   K_   is  greater  than   ^t' /  (7  l^?  ^i)   i.e.,  greater  than   n/log  n. 

But  then  K  >K.   Thus  we  have  a  contradiction  by  the  Pidgeonhole  principle. 

It  is  easy  to  see  that: 

PROPOSITION  3.1.  At  the  end  of     (a)  of  phase   i  of  stage  2,   each   large  super- 
vertex  has  size  at   least     s(i)/(6  log  n) . 


We  now  will  show 

LEMMA  3.1.  If  a  supervertex  has  size    ^s(i),  then  it  is  declared  large  idth 

-(S-1) 


probability  at  least     1-n 
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Proof.   Let  p(i)   be  the  number  of  cells  of  the  table  of  the  supervertex,  which 

stay  at  value  0  after   (a)   of  stage  2.   We  know  that   s(i)   is  increasing  with 

4 
i   and  s (1)   is   21og  n.   The  probability  that  the  particular  supervertex  fails 

to  be  declared  large  is 

prob{There  is  a  particular  memory  cell  staying  at  value  0} 
=  (l-l/(s(i)/(B  log  n)))"^^'  ^  e"^'  ^°^  " 


so 

prob{p(i)  =  O}  ^  n-n   =  n   ""  o 

-2 

We  now  inductively  assume  that  for   iJJl,  with  probability  >l-n   ,   "At  least 

— r  nodes  are  in  large  supervertices" .   The  basis,   i  =  l,  was  proved  previously. 

2^ 

Let     N.      be  the   number  of   large   supervertices   at   end  of      (a)      of  phase      i. 

Fact    1  .      For   any      n,    p,3        with     n>0,      l^p>0      and     0  <  ;  ^  1 

|(l-S)npJ  ^ 

'(a)  T  (")P^(1-P)  <    exp(-s'-np/2) 

K=0  ^ 

and  (b)  £  (")p^(l-p)""^   <  exp(-6^np/3) 

K=    [(1+6  )np] 

The  above  fact  follows  from  the  boxonds  of  [Chernoff ,  52]  . 
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LEMMA  3.2.  For  any     6'  £  (0,1),  if     N.  ^s(i)-(^^)  then  the  number  of  large 
aoaeptors  at  the  end  of  (b)   is  at   least     L  =  (l-B')N^  •  "•  . . , —  with 
probability     ^  1  -  n 


-2{B')2 


Proof.   By  Fact  1,  applying  it  to  the  Bernoulli  experiment  of  N.   trials  and 
success  probability   r  =  log  n/(a's(i)),  we  get 

-{'-')  N-  (r/2) 
Prob{ number  of  successes  >  L}  > 1  -  e  ^    i 

>  l_3(6")(3/2>log^n^  1  -  n'^/^  (6  ' )  %  i  .  n"'^ '6  ' )  ^    ^ 


By  Lemma  3.2,  and  Proposition  3.1  we  then  get: 

COROLLARY   3.1.     Tke  nionber  of  nodes  in  acceptors  at  the  end  of  phase  i,   step    (b) 

stage  2  is  at   lea^z      (l-B')    N    (— )  log^~  n         with  the  same  probability  as  that 

i  6 

stated  in  Lemma  2. 


Now  we  get  an  estimate  on  N.  based  on  the  inductive  hypothesis:  Since  the 
product  of  number  of  large  supervertices  times  min  size  is  at  least  equal  to 
min  number  of  nodes  in  them  we  get: 


N.  s(i)  , 

1         n  .   -  .     ,,     n  log  n             ,., 

-r ; >  — r-   implying   N.  >  — : (*) 

^1  ^°^^-2^  ^-2^(1)5^ 


Def in  i t  ion  3. 1 •   Let  q  be  the  probability  that  a  random  edge,  not  yet  used 
at  stage  i,  goes  into  an  acceptor,  given  the  inequality  of  Corollary  3.1. 


V 

COROLLARY  3.2.   q  ^  (1-  B')a  ^  "^"^  " 

6  2^s(i) 
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Proof.   We  notice  that  q  =  (number  of  nodes  in  acceptors) /n  and  then  we  can 
apply  Corollary  1.  n 

Now  we  prove  our  central  Lemma. 

LEMMA  3-3.   -fcr  any     a  >4.  Each   large  proposer,      V,    has  at   least  one  departing 
edge  into  a  node  of  an  acceptor,   irLth  probability  at   least     1-n  ,  a  >4, 
given  the  condition  of  Corollary   3.1. 

s  (i ) 

Proof.   Let   s'(i)  =  s(i)/log  n.   Since   P   is  large,   IpI  >  - — ; .   Let 

'  '    B^  log  n 

c'  =  c/3.   Si-.je  edges  are  not  reused  by  the  algorithm,  each  new  edge  tried 

c    c ' 
is  a  random  edge,  with  probability  p'  ^  —  =  —   of  being  there.   By  the 

Chernoff  bounds  on  a  Bernoulli  experiment  of   |p|   trials  and  success  probability 

s'  (i) 

p'-q  ,  there  are  at  least   (l-b)(l-B')  c'  — - —  departing  edges  from  P, 

with  probability  at  least   i _ e'^^^ ■  (1/6) s'  (i)  ^ ^ _ ^-3£2c.  (1/6) ^   ££(0,1). 

Let  \     =  (1-6')  -T-  ,  \     =    (l-£)(l-6')  %-  .   Then,  by  using  Corollary  2,  we  get 

1  Q^  •^  P 


Prob{no  edge  exists  from  P   to  an  acceptor} 
^  .     .  (s (i)/log  n) A2 


<  (1-X^  •(logV2S(i))^2="<^' 

,   K-1   X  s'(i)     /   X  >,^°5''"^"/2ivX2s'(i) 

<  (1_A^  122_^)  2        ^   1-  ^  \   .^.^ 


<  f."  1  2 


e 


^l^.log^"^n/2i 


K-1 
J 

K-2 
<  g-  -1-2-^W   ii    ^  n 


X  \    ■,   K-2 
~  1  2-'-°'3    ^    ^  "'^ 
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where  a  =  (K-2)X  X-,   and  we  can  now  choose   c,   g'   (up  to  row  free)  and 
K  >4  to  make   a  >4.  n 

LEMMA  3.^4.  (Induction  Step).  At  least  .  ^  of  the  nodes  are  (at  the  end  of 
ph^se  i,  stage  2)  in  superoertices  of  size  at  least  s(i  +  l)j  with  probability 
at   least     1  -  n 


Proof.   We  condition  on  the  events  described  in  Lemmas  3.1,  3.2,  3.3.   By  the 

induction  hypothesis  at  the  start  of  phase  i,  at  least  n/2   of  the  nodes, 

-2 

were  in  supervertices  of  size  >s(i),  with  prob  >l-n    and  by  Lemma  3.3 

and  previous  results,  all  these  nodes  end  up  in  acceptors  with  probability 

-4  i+1 

^  1-n   .   Suppose  then  that  despite  the  conditions,  less  than  n/2     nodes 

are  in  supervertices  of  size  ^s  (i  +  1) .   Then,  at  least  n/2     of  these  nodes 

s  (i)  ^ 
must  be  in  supervertices  of  size  <s{i  +  l)   i.e.,<— : — 

2^  (1 +  £)log  n 

Call  the  set  of  these  nodes  V  .   Let  ^.      be  the  number  of  acceptors  at 

lo  \     ^  s(i) 

stage   i.      Then      ^u/".    ^    (1  -  £)  (N.  )    — ttt-        (this   is    so   since,      N.  •— — <    n)  . 

^  1  1  Qsd)  1  b  log  n 

n  lo  ^*^n 
So  ^e/.    <  (1-c) ^  £  .   By  the  Pidgeonhole  principle,  the  size  of  some 

IS  (i) 

supervertex  in  V   must  be  at  least   (— r  ) /^ . 

2^     ^ 

2  ,.  >  2,., 

>      as  (i) ^      as  (i) 

i    K+1  i  4     * 

(i  -  £)2  log   n-B        2  (l  +  £:)logn 


But  this  is  a  contradiction  to  the  definition  of  V  .  c 

f  ft  —  1 ) 
Now  let  us  assume  that  N.  <  s{i)  (this  is  the  opposite  of  the 

1  a 

condition  of  Lemma  3.2)  and  that  there  exists  no  large  set  of  size  greater 

than  T—  .   Let   a'  =  v'a/(6(6-l))  .   By  applying  the  inequality 

log  n 
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N.  >  — : —  we  get  s{i)  ^  a'\n  log  n  2    i.e.   s(i)  >  a'v'n'-log  n.   We 

^   2^s{i)6 

can  choose  a>l  finally  so  that  a'  ^  1.  The  above  result  implies  that  the 
number  of  large  supervertices  gets  very  small  only  after  the  execution  of 
stage  2.   So,  we  conclude  that: 

LEMMA  3.5.  After  the  end  of  stage  2  with  probability    ^1-n    for  any    y  >2 

we  either  have  a  supervertex  which  is  bigger  than    > j—    vertices  in  size, 

log  n 
or  we  have   less  than     v'n  supervertices,   each  of  size  at   least     v'Ti'log  n. 

Proof.   By  Lemmas  3.1,  3.2,  3.3,  3.4  and  the  previous  remark  and  by  taking 
products  of  probabilities  of  lemmas.  D 

Now,  we  prove  our  full  Lemma: 

4 
LEMMA  3.6.  At  most  one  phase  of  stage   Z  leaves     O(log  n)  supervertices  around, 

-3/2 

wzth  probability  at   least     1-n         .      Then,   the  algorithm  of  stage   Z  finds  the 

connected  components  of  the  graph  in     O(loglog  n)      time. 

Proof.      By  Lemma   3.5,    either  we   already  have   a  very   large   supervertex  of  size 

at   least     —       at  the   end  of  stage   2,    or  we  have    <  i-^     supervertices   of 

log    n 
sizes   at   least      yn'log  n     each,    and  the  probability  of  the   disjunction  is 

-a'  c' 

^  1  -  n    ,   Stage  3  examines  random  edges  with  density  —  .   We  now  examine 

n 

2  cases: 

Case  1 .   In  the  case  of  the  existence  of  a  large  supervertex   S  of  size 

n  4 

in  just  one  phase,  we  claim  that  all  but   log  n  of  the  remaining 


T   4   , 
log  n 


vertices  will  join  S  with  high  probability.   To  prove  this,  we  will  use 

Chernof f  bounds  on  the  Bernoulli  experiment  of  M  >  —       trials  and  indepe 

log  n 

c' 
success  probability  at  least  —  .   The  probability  that  a  particular  vertex 
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does  not  join  with  S   in  one  step,  is  at  most 

^^_cljn/log\  ^  ^-c'log^  ^  oln''^'^  . 
n 

4 

The  probability  that  more  than  0 (log  n)  vertices  fail  to  join,  is  then 

4       -4c'       -3c'        -6 
log  n  •  o(n    )  =  o (n    )  =  o (n   ) 

Case   2.      Consider  a  particular  superverticex,      A,    of   size  at   least      vn-log   n. 
Consider  another  supervertex,    say     B,    of   size   at   least      vn-log  n.      Let   us 

assume   that      B      has    at    least   one    edge   out   of   it.      The    probability   that      B's 

v'n-log   n    vn-log   n         ,^       log   n, vn-log   n 
edge  does   not   connect   to     A     is   at  most      (1-    )  =    (1 ) 

^  n  r- 

vn 

<n-^   . 

-2 
Hence,   B  will  connect  to  A  with  probability  at  least   1-n 

Since  we  have  a  total  of  ^vn  of  such  large   B's,  the  probability  that 

-  -2    -3/2 
at  least  one  of  them  will  miss  to  hit  A  is  at  most   >n-n   =  n 

-3/2 
Hence,  with  probability  at  least  1-n    ,  all  of  the  size  of  at  least 

v'n-log  n  supervertices  will  hit  A.   This  completes  case  2. 

4 
In  any  of  these  two  cases,  the  remaining  vertices  are  at  most   log  n  and 

it  will  take   0  (loglog  n)   parallel  time  for  stage  3  to  complete  the  finding 

of  the  connected  components  (with  certainty,  conditioned  on  the  events  of 

case  1  or  2).   This  completes  the  proof  of  Theorem  3,1.  n 


3.2   Improving  the  Number  of  Processors 

We  now  assume  the  P-RAM  model  with  probabilistic  choice  instead  of  the 
W-RAM.   In  the  P-RAM  we  can  run  MERGE  in  0  (log  n-loglog  n)   expected  time, 
by  simulating  each  probabilistic  selection  of  a  processor  in  a  write-conflict 
by  a  tree  of  pairwise  selections. 
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It  is  shown  in  [Chin,  Lam,  Len,  81]  that  the  minimum  of  n  elements 
can  be  computed  on  a  P-RAM  in  time  T  where 

T  =  ""n/k  - 1  +  log  k      for  k  ^  —  ,    k  =  number  of  processors . 

Using  the  above  technique,  we  have  that  the  total  expected  time  is 

loglog  n 


^'^'  -  1  +  log  k)  <  o(^  +  (loglog  n) log  k 


7  J  \   k         -   — ^  --y   -   -^j^ 

t=0 

Selecting  k  =  n/(log  n-loglog  n)   we  have  the  optimal  value   O(log  n-loglog  n) 

2 
of  the  expected  total  time  with  an  expected  number  of  nk  =  n  /log  n- loglog  n) 

processors. 

COROLLARY   3.2.     The  expected  parallel  time  for  conneated  components  of  the  random 

graph     G  with     p  ^c/n     is      Odog  n-loglog  n)      in  the     P-RAM     model  and  the 

2 
expectea  number  of  processors  vs     n  /(log  n'loglog  n) . 

3. 3  Biconnected  Components 

[Ja'Ja',  78]  provided  an  algorithm  for  the  P-RAM  which  finds  the  biconnected 
components  of  a  graph  in  parallel  time   O(log  n-log  d'log  k)   where   d  is  the 
depth  of  the  graph  and  k   is  the  number  of  biconnected  components.   The  n\jmber 
of  processors  needed  in  n  /log  n  .   Again,  this  algorithm  can  be  implemented 
on  the  W-RAM  model  to  run  in   0 (log  d-log  k)  parallel  time  with  n  /log  n 
processors.   The  Appendix  shows  the  expected  depth  is   d  =  O(log  n) .   [Reif, 
Spirakis,  81]  show  the  expected  number  of  biconnected  components  is  k  =  O(log  n)  . 
These  results  imply: 

COROLLARY  3.3A.  The  expected  parallel  time  complexity  for  finding  biconnected 

2 

components  in  the  random  graph     G     with     p^c/n,  c  >5,  is     0(  (loglog  n  ) 

n ,  p 
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2 

fov  the  W-RAM  model  and     O(log  n-(loglog  n)    )      for  the   P-RAM.      The  number  of 

processors  needed  is     n       for  the  w-ram  and     n  /log  n     for  the  p-ram.      The 
probability  that  the  parallel  time  is  more  thar.  this  is  o(— ). 

By   using  Lemma   3.1  we  get 

COROLLARY   3.3B.     The  expected  deterministic  space  for  finding  biconnected 

2 
components  in  the  random  graph     G  with     p^c/n,   c  >5,   is     O(log  n- (loglog  n)    ) 

'^  n,p 
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4.    EXPECTED  PARALLEL  TIME  FOR  ALL  PAIRS  SHORTEST  PATHS  IN  GRAPHS  AND  DIGRAPHS 

Given  an  n  vertex  graph  G  or  digraph  D,  the  all  pairs  shortest  path 
matrix  A  is  an  nxn  matrix  such  that  A(i,j)  is  the  length  of  a  shortest 
path  from  i   to   j   in   G   (or   D) .   Let  A  (i,j)   denote  the  length  of  a 

shortest  path  from   i   to   j   going  through  at  most   k   intermediate  vertices. 

d  0 

Clearly  A(i,j)=A  (i,j)   where   d  =  depth  of  G   (or   D)  .   Let  A  (i,j)   be  1 

if  {i/j;   is  in  G   (if   (i,j)   is  in   D)  and  °=  else.   It  is  easy  to  see  that 

A  (k,j)  =  min  {A    (i,m)  +A    (m,j)} 
m 

d  2   4      2^       r     1     k 

Hence  we  can  compute  A   by  computing  A  ,A  ,...,A   ,   i=  log  k  .   A   can  be 

k  ^2 
computed  from  A  '    using  the  matrix  multiplication  algorithm  with   +   sub- 
stituted from   *   and  min   for   +.   (See  also  [Dekel,  Nassimi,  Sahni,  81].) 

Assuming  binary  representation  of  lengths  and  by  using  bit  addition  with 
parallel  carry,  we  can  perform  all  additions  in  the   k    phase  of  the  algorithm 
in  constant  time  on  the  W-RAM,  by  assigning  one  processor  to  each  pair  of   (i,m), 

(m,j).   The   min   operation  can  obviously  be  done  in  constant  time  in  the  W-RAM. 

2 
To  see  this,  we  have  to  use   0  (n  )   processors  as  follows:   Assign  n-1  pro- 

k/2        k/2 
cessors  to  each  m.   Each  one  of  them  c_."apares  the  computed  v  =A    (i,m)  +A    {m,j) 

with  one  of  the   n-1  other  values.   If  v   is  bigger,  then  the  corresponding 

processor  writes  a  1  in  a  memory  location  C  ,  else  it  does  not  write.   Next, 

m 

one  processor  for  each  m  checks   C  .   If  C   is  not  written,  then  this  processor 

m        m  ^ 

k 
outputs  its   V   into  A  (i,j).   Only  one   C   will  not  be  written  in  this  step, 
m  •'       -'       m  ^ 

Hence: 

THEOREM  4.1.  The  all  pairs  shortest  paths  of  a  graph  or  digraph  aan  be  aomputed 
in     Odog  d)  worst  case  time  on  a   w-ram,  where     d  is  the  depth  of  the  graph 
or  digraph. 
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Note  that  the  above  procedure  is  also  valid  if  we  are  computing  the  all 

pairs  minimum  weight   shortest  paths  provided  that  edge  weights  are  nonnegative 

and  satisfy   the  triangle  inequality.      The  only  modification  needed  is 

A  (i,j)  =  the  weight  of  the  edge   (i,j),  if  it  exists,   =°  else.   This  leads  to 

an  expected  time   Odoglog  n)   for  the  W-RAM  model  for  random  graphs   G     or 

n  /  p 

digraphs      D  since   the   expected     d     is      O(log  n)       (see  Appendix), 

n,p 

COROLLARY  k .\  .      The  expected  time  for  all  pairs  shortest  oaths  in     G     or 

f  •'  ^  '  n,p 

D  models  lyith     p     as  in  the  Theorems   1. 1,    1.2  and  arbitrary  weights,   non- 

n,p 

negative  anxd  satisfying  the  triangle  inequality,    in     Odoglog  n)      for  the 
W-RAM  model  by  using     n       processors.      It  is     O(log  n-loglog  n)      in  the  P-RAM, 
by  using     n  /log  n     processors.      The  probability   that  the   time  is  more   than 


stated  above,    is    <n     ,  y  >  2. 


By  Lemma   3.1  we  get; 


COROLLARY  h .2 .      The  expected  deterministic  sequential  space  for  all  pairs 
shortest  paths  in  the  random  graphs     G     or  random  digraphs     D     (which  are 
arbitrarily  weighted  with  nonnegative  weights  satisfying  the   triangle  inequality) 
is      0  (log  n-loglog  n) . 
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5.    EXPECTED  PARALLEL  TIME  FOR  MINIMUM  WEIGHT  SPANNING  TREE 

5. 1  An  Algorithm  for  Minimum  Spanning  Trees  of  Undirected  Graphs,  in  the  W-RAM 

Let  G  be  a  weighted  undirected  connected  graph,  with  positive  weights. 
The  connectivity  algorithm  of  [Hirschberg,  Chandra,  Sarwate,  79]  can  be  used  to 
obtain  a  minimum  weight  spanning  tree,  by  picking  the  edge  with  minimum  weight 
while  merging  supervertices  with  one  another   (see  [Savage,  78]).   Let  G  be  an 
instance  of  the  random  graph  G    ,  p>c/n,  as  input,  with  arbitrary  nonnegative 
edge  weights.   We  can  use  our  algorithm  MERGE  for  connectivity,  by  using  the 

[Ja'Ja' ,  78]  algorithm  for  the  deterministic  part.   Picking  the  minimal  weight 

3 
edge  will  take  constant  time  in  the  W-RAM  if  we  use   n  -p  processors.   Hence, 

COROLLARY  5.1.  Suppose  we  are  given  an  instance  of  the  random  graph     G  , 
p >  c/n  (c     is  the  constant  defined  in  Z.l) ,   as  input,   with  arbitrary  nonnegative 
edjge  weights.      Then  we  can  compute  a  minimum  weight  spanning  tree  in  the   w-ram  in 
expected  paj^aZlel  time     Odoglog  n) ,    using     n  p  processors  on  the  average. 

COROLLARY  5.2.  The  expected  deterministic  sequential  space  for  a  minimun  weight 
spanning  tree  is     O(log  n-loglog  n),  when  the   input  graph  is  as  in  Corollary   5.1. 

5 .2  An  Alternative  Method  for  Minimum  Spanning  Forests  in  the  W-Ram,  which  Can 
Also  Be  Used  for  Directed  Graphs 

A  parallel  modification  of  the  Kruskal  algorithm  will  be  used.   First,  we 
sort  the  edges  of  the  graph  {or  digraph)  into  a  sequence  of  nondecreasing  weights, 
Then,  separately  for  each  edge,  we  test  whether  the  endpoints  of  the  edge  belong 
to  the  same  component  of  the  graph  (resp.  strong  component  of  the  digraph) 
generated  by  edges  of  less  than  or  equal  weight.   The  edge  will  be  incorporated 
into  the  constructed  forest  iff  the  answer  is  negative  (i.e. ,  if  the  endpoints 
of  the  edge  belong  to  different  components)  (see  [Kucera,  81],  [Reif,  82]), 


■26- 


In  the  case  of  digraphs,  the  above  procedure  will  produce  actually  a  minimum 
weight  directed  acyclic  graph  (DAG)  which  spans  the  digraph. 

The  above  technique  has  a  time  complexity  bounded  by  the  sorting,  since 
the  connectivity  tests  can  be  done  in  Odoglog  n)   expected  time  in  the  W-RAM, 
by  employing  the  [Ja'Ja',  78]  algorithm.   Sorting  takes   Odoglog  n)   time  in 
a  W-RAM  (see  [Borodin,  Hopcroft,  82]  and  [Shiloah,  Vishkin,  80]). 

LEMMA  5.1.   .4  minimum  spanning   DAG  (directed  aayclic  graph.)   of  a  digraph 
can  he  found  in     Odoglog  n)  ez::peoted  parallel   time  in  the   W-RAM. 
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6.    EXPECTED  PARALLEL  TIME  FOR  RANDOM  GRAPH  ISOMORPHISM 

The  following  algorithm,  given  by  [Babai ,  Kucera,  79]  is  essentially  a 

Breadth-First-Search  procedure  applied  to  random  graphs  of  the  model  G     with 

n,p 

P=  1/2. 

[1]   Classify  vertex  by  valences  so  each  vertex  is  labelled  by  its  valence. 
Let  C  ,  ...,c   be  the  classes  of  the  above  classification,  arranged  by  the 
order  induced  by  this  classification. 

[2]   In  the  first  refinement  we  let  N. (v)   be  the  number  of  neighbors  of  v 

4 
in  c.   and  let  N. (v)  =  smallest  nonnegative  integer  congruent  to  N,  (v)mod  4. 

Then,  two  vertices  v,u  are  ordered  now  if   (a)  they  were  of  different  label 

before  of   (b)  they  had  same  labels  but 

(Nf(v) N^(v))  <  (Nf(u), N^(u)) 

i         n         1         h 

lexicographically. 

[3]   Let  c  ,c  ,...,c  ,  be  the  equivalence  classes  of  [2].   Apply  step  [2] 
to  each  of  these  classes. 

THEOREM  6.1.   [Babai,  Kucera,  79].  Let     u  be   the  set  of  vevtiaes  whose  class 
is  not  a  singleton  after  step    [3].  Then 

Prob{|u|  >  2} <  exp(-2cn) 

for  some  absolute  constant     c >  0. 

The  valence  classification  can  be  done  in   O(loglog  n)   time  on  the  W-RAM 

2 
model  by  using  0 (n  )   processors.   In  the  P-RAM,  it  can  be  done  in  time   0{log  n) 

2 
by  using   n  /log  n  processors.   The  partial  orders  created  by  the  refinements 

are  best  stored  as  a  function  corresponding  to  them.   These  function  computations 

cost  not  more  than  Odoglog  n)   time  in  W-RAM  and  O(log  n)  in  the  P-RAM,  by 
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2  2 

using     n        and     n  /log   n     processors,    respectively.      So: 

COROLLARY  6,2.     4  oanoniaal  labelling  algorithm  on     G        ,   p =  1/2,    takes  expected 

n  /P 
2 
parallel   time     Odoglog  n)      in  the  W-RAM  model  jjit/z     n       pvocessovs  and     O(log  n) 

.  ,    2 

time  in  the   P-RAM  m.t/2  n  /log  n  pvooessovs. 

By  Lemma  3.1,  we  get 

COROLLARY  6.3.  The  expected  sequential  deterministic  space  needed,   for  a 

canonical   labelling  algorithm  on     G   ,  p =  1/2,  is     O(log  n-loglog  n) . 
*^        ^  n ,  p 
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APPENDIX 


THE  DEPTH  OF  A  RANDOM  GRAPH   G      AND  OF  A  RANDOM  DIGRAPH   D 

n,p  n,p 


We  prove  here  Theorems  1.1  and  1.2. 

LEMMA  A.O.  Theorem   I. I   imrlies  Theoven   1.2 


Proof.   Let   D   be  anv  instance  of   D 


n  ,c 


The  decth  of   D   cannot  e.xceed 


the  depth  of  the  underlying  undirected  version   G   of   D   (which  is  produced 


from   D  by  removing  all  edge  directions) .   However,  G   is  an  instance  of 
,   _     2 


n,c 


with 


Theorem    1.  1       holds    for      G 


n,p' 


for     p '      m   the   range 


[0,1]    - 


_0        0    _   /_0  j 

n     '      n  \  n  / 


where      c„      is    a    fixed    constant    (to   be    determined   in    the   oroof   of    '^heorem   1.1).      Hence, 
0 

2 

Theorem  1.  2   holds  for   D     for  p   determined  by   p '  = 2p  -  p  £  R.        D 

n  /  ^ 


A  1.   Proof  of  Theorem  1, 


for  p  =  9  (-) 
n 


We  prove  here  Theorem  1. 1   for  undirected  graphs.   This  provides  an 

upper  bound  on  both  the  average  depth  and  the  depth  distribution  of   G     for 

Cq    2c^    /c.,2-' 


all  p  =  6 (— )   except  those   p   in 


!o)' 

n  / 


where   c„   is  the 


cutoff  constant  for  paths  of  length   dlog  n,  d>2.   This  provides  a  partial 


answer  to  the  problem  of  finding  the  depth  of  G     for  p  =  9 (— )  ,  open  up  to 

n  ,p  n 

now.   The  remaining  cases  of  p  =  o(— )   and  p  =  ."i;  (— )   can  be  derived  bv  results 

^  n  n 

of    [Erdos ,    Renyi ,    59]     (see    A. 2    and   A. 3    of    the   Appendix). 

Consider   the    following   two-staqe   experiment:      Draw   an    instance    of      G 

c ' 

with  p  =  —   (c'  >  0) .      Then,  for  each  edge  which  failed  to  appear,  draw 

c' 
again  with  probability  p  =  — .   The  result  of  the  combined  experiments  is 

2c'     c'  2      c' 

an  instance  of  the  graph   G     with  p=p,  +  (1-p  )p^  =  -  ( — )   =  9( — ). 

n,p  1  i2n  n  n 

The    first   stage   of  the   above   experiment    constructs    an   instance      I      of     G 
Let     <?     be   the  event    "the   instance      I      has    a  path   of    length     6  • log  n      (where 


A. 2 

5  is  a  fixed  constant)"  and  &       be  the  complement  of   I.   Let  us  consider 
the  probability  of  €.      We  shall  need  the  following  Lemma: 

LEMMA   Al.l.  (  [Erdos ,  Renyi ,  59]).  Given     G     'uiit'n     p  =  -,  let  T^  , 

n  ,  p  n  0  •  1  og  n 

:s  c'    Zenath    5  -loan      tK     G        .      ZheK 

n  ,p 

-X 
liir.      Frobl.T„    ,  =0  ►    =    e 

0 • log   n 
n-^ 


,  ,-    ,0-logn-l  (5  .  log  n) 

A   =    (2c)  ^  T^Tj — 

6  •  log   n)  ! 


The    above    lemma   can   be    used   to   Drove    that      "3c^>0:       for      c'^c^      the 

0  0 

graph   G     with  p'  =  ~  has  at  least  one  path  of  length  5  •  log  n   with 
n  ,p  n 

c 
probability  -^1   as   n -"=   and  for  c  <  c^      the   G     with  p  =  —  has  no 
^         ^  0        n,p  n 

such  path  with  probability  -^1   as   n-^«"  (to  prove  that,  find  the   c    such 

that  \{c)-*^     if   c>c   and   X(c)-*0   if   c<c   for  n-*°^).   Assume  in  our 

-Ci '  (c '  ) 
construction  that  c'  '^  c    .      Then,  Prob(<f)^l-n        ,  where   a'(c')>2   is 

a  constant  depending  on   c'.   Let  us  condition  the  following  on  the  event  ^. 

We  start  from  the  vertices  of  the  path  T   of  length   6 • log  n   (at  least 

such  a  path  exists,  by  S)       and  use  them  as  the  initial  border  set   B.   for  a 

breath-first-search  (BFS)  process  in  the  graph  induced  on  the  vertices  of 

G     by  just  the  second  stage  of  the  experiment. 
n,p  ^  c 

^^t   |Bq1  = 0 . log  n   and  let  us  find  the  distribution  of  the  number  of 
vertices  to  which  the  vertices  of  T   are  immediate  neighbors.   The  set  of 
these  vertices  will  be  the  new  border  set  of  the  second  stage  of  BFS,  and 
let  us  call  it  B  .   We  are  interested  in  the  Prob{ ; B  j  >2-5.1og  n}.   Let 
|Sq|   be  equal  to   |b  |  = 6 • log  n   (the  number  of  visited  vertices  at  the 
beginning  of  the  first  stage  of  BFS) .   The  unvisited  vertices  are 
"  -  I  Sq  I  =  n  -  5  •  log  n   and  each  of  them  has  probabi  lity   h  =  1  -  ( 1-p  )  ^  '  ^°'^  " 
to  be  connected  to  B  .   Hence 


A. 3 


(0  I,  X  , ,  ,  ,  n-5  loG  r.-x 


But   h   =  1  -  (l  -  —  r'  °^ '^   and  since  ^^—^ '  0   as   n -^  ^  we  have  that 

0  \  n  /  n 


Y^     -.  £ — z — °2 —     as      n-*^'.      The   mean   number    of       |B    }       is 
On  1 

2        2 
(n-|S    |)h^    =    c'f  log  n    -   ^-^= = -^    c';   log  n  as        n  -  =^ 


Let   c'  >  2      and  _et  us  find  the  probability   Frob^iB  !  < 25 • log n =2 j B  i } 


From  [Feller,  57]  eg.  14C  this  is  less  than 


whe  re 


0   \,  2-  log  n  ,,  ,  ,n-t  loa  n-26  log  n 
2-  loQ  n' 


(n-^  lcgn-25  logn+l)h_        c'5  log  n  , 

(n-  6  log  n+1)  h  -2:  log  n      c'j  log  n -2  c  log  n    c'-2 

as   n  -* 


Then   Q   is  asym.Ftotically  equal  to 

'  n-  Slog  n\  /  ,       \2  6  log  n  /      ,  _.      \  n-3  ;  log  n 


r-   ^^°^"\  Ic   Slogny'^  '°^"  I  c--:ioqn  V 

\  2   610Q  n  /  \    "    y'        \        ^    / 


c 


.        ..     ,  c'-2 

2  clog  n 


26log(c'/2)  -c' •'        c'   ^   -f  (  c  ' -21oa  (c  ' /2)  ) 
=   n  /n     •  — ^ — r  ^  n 

c  -2 


=   n 


for        e  =  6(c'-21og(c'/2) ) 


Hence,   Prob{ j B  1  > 2 •  | B  j }  ^ 1  -  n  ".  We  now  inductively  form  sets  S    and 

1  U  K 

B,   such  that   S   =  B.  +  B  +  •  •  •  +  B,  and  h,  =  1  -  (1-p^)       and  B,  , 

k  k    u    1        k        k  2  k+1 


being  selected  according  to  the  density 


/  n-S  \         n-S  -X 
Prob{B^^^=x}  =    /)h^^l-h^) 


(This   models    the   EPS  process) 


A. 4 


I  _    I  ^  n 
We    claim  tnat ,    as   soon   as      ib    I  ^—   , 


Prob{  IB,  I  ^2-  IB       ,1  giVc.      F;  >  1  -  n    " 

1  1-1 

for 

1=1 

r  =    A       J  B  I  ^  2  ;  3       I ) 
.=1  ^  :-i 

This    is    so,    since    the    first    extension    (fror    tne    srr.allest    border    set      B_  =  ;  "  Ic 

IS    tne    niost    diffirulc.      Then,  tne    event 

k-1 

<f        =  A       (|BI>2IE  I) 

1  .  1  1-1 

i=i 

has    crobatility      ^     F!       ( 1-n      )  ^1  -n  ar.d    let      k      be    tne    last   index    for 

1=1  is    I 

which   Is  I  <T-    5et   Is,  I  =  -  ,1-c)   witn   I B,  I  ^  ^rr—     =   ^{l-e)  .      From 
k    2  k     w  K     ^      t 

Is  I  =  iBj  +  ■  •  •  +  IB,  I  ^  l3.  I  *  2lB,  i  +  •  •  ■  +  :''^IE,  i   we  get   I  S  I  ^  ( 2*^*  •"■- 1 )  I  B„  I 
k  J  k      o'       u  J  k  i-i 

imc lying 

locl^^  +  1  I  -1 


loq 


\2   5  log  n     / 


Kence 


l-£ 
=   log  n  +  log  -r^  -  loglog  n  -  1 
2  d 


Su  1  =  S  +B,    >S  +  2B,  ^^  (1-e)  -r  ^  (l-£) 
k+1    k   k+1   k    k   2         2 


>   n(l-£) 
So,  we  proved  that: 

LEMMA  A  1.2.  Conditioned  on  the  event    <?,  the  BFS  process  ijill  visit  at 


least     n(l-e)  nodes  of    G  in     'v        stages  where     i,  <  log  n,  L'it^ 

n,p^  1  -  1 

-S+1 
pvobabiliti)     ^1  -  n 


n,P2 


A. 5 


Let   us    allow   the   BFS   to   continue    until   it   can   visit  no  more   vertices,    and 
let      f      be    the    final   stage    (such   that       :-f+i'  = '^)  ■       Consider    the    stage      f  <f 
such    that       1  S       i  ^  n  -  log   n      but   either       i^f +  i  ^   >  n  -  log   n      cr       ,B;=0. 
After      f       either    the    process    stops    cr    there    are    at    nost      : • log   n      nodes    more 
to  visit    and   this    cannct    take   mere    than      --log    n      stages    m   either    case.       At 
f,    the    set      5^,       of    "-unvisited"    nodes    is    at    least    of    size      :  •  log   n.       Consider 
the    cackward   process    m   vr.icn    tr.e    set    of    unvisited   nodes    increases    as    stages 
are    executed    m    reverse.      The   new   process    is    symjietric    to   the    BFS    and   because 
of   the    symr^etr-/   we    can    argue    that: 

lE.'^-'lr.     Al.3-      V-Jr^.  rrc^zzz  liz\.    ^r    lezsz      1-n   ~*  ^ ,    z'r.e  rrccess   cf  cack- 
scrd  f.~i"  is'-ll  rezc'r.  z  sez   cf  zz   l^zsz     n(l-£-)      Kcdes   ir.  r/^-.ter  :f  stages 
T, ^ log   n. 

By    Lemm.as    Al.l    ar.d    A1.3,    tnere    is    a   sta^e    n'urxer      :'    sjch   ~'.a-    f'--     <-'  <-     < 

-    -       .^  _  .     _  .^  _ 

in   which    tne   border   sets    of    the    two   processes    either    coincide    or   one    is    a 

subset   cf   the    other.      Hence,    T'  +T        is    an    upper  bound   to  numier   of   stages 

up    to   stage      f      and   the    total   number    of    stages    up    to    the    end   cf    the    (forward) 

BFS    is      '.'  ■*■',-*■    -log  n  <  (2  +  c  )  log   n.      The   probability    of   both    the    events 

considered    m    Lemjr.as    Al .  2    and   A1.3    together    is    at    least      1    -    Zn"""*"    .       Hence,    we 

have   proved   that 

LEMMA  A1.4.      The  diameter  cf  the    larcer  ::crrrcKenz    ( zr.a  hence   the  depth 

of)      G  is     ^(2 +  6) -log  n     i.'ith    (conditional   en     Sj    vrobcJoilit'^   at    least 

n  /  p  ..  *^ 

1  -  2n 

Dropping  condition    <f,  we  get  that: 

2c'  c '  2 

The  depth  of  G     with  p  > ( — )    is  < (2  +  6)  log  n  with 

n  ,p  n  n 

probability   at    least      1  -  max(2n"'^'^    .n"*^'*^      ),    where      6   =    5  (c' -21og  (c' /2)  )  . 

By  selecting      6      and     c'      suitably,    we    can   guarantee   a  probability   of 

-Y 
at    least      1-n      ,    with      Y  >  2. 


A. 6 


Note  that  for  p^c'/n,  Lemma  A  1.4  implies   Prob{depth  of  G  ^^5-loqn;-*  1 

c'       2c'  c'    '' 

as      n-'°^.      Kence,    our    result    holds    for    anv      p,      c€  [ — ,    -    ( — )~],      where      c' 

n    n      n 

IS  the  cutoff  constant  for  paths  of  length   ""•log  n,  i  >2.      This  proves 

Theorem  1.1    with   c-  =  c'   for  the  case   p  =  6 (— )  .  a 

0  n 

A2 .       The    Case    cf      r  =  o(n      ) 


LEMMA  A  .2       (  [Eraos  ,    Renyi ,    59]).      lev   v'r.e   cverzpc   Vc:>^>:gg   cj   a  crcZ'''-   C' 

n     vevzices   cs'.d    m      -adc-es  Jt    z'r.e  Kur.ter     2n,'m. .      ler  a  ^^ccc'c  be   aallea  balcrr.aec 

if  iz3  crjerc^e   valence   is   ^reazer  z'nar.,    cr  equal  zo  the   average   valence  cf 

an'^   c"  its  3ucc,var'ns.      I'nen,    zhe  rvctabilit-^    that     G  ccnzains  at    least 

-      ■  -      ^  ■  -  n,f 

one  r:er.cer  c'   zhe   class     3,     .      :'  calar.ced  sra::hs   c'     k     vertices  and     i 

k,K       ■  -      - 

eases   Is 


Prob{,B^^^,^l;   =  0^-^J 

COROLLARY  A. .  2 .    Prob{depth  of  G    <  log  n;-^l  as      n  ^  =°  fcr      p  =  o(n~'''). 

Proof.   For  p  =o(n   )   we  can  always  write   p   as  p  =  (c/ (n*a)  (n)  )  )  where 

c>0   IS  a  constant  and  L-in)  -^"^      as  n^°^.      Let   B  =  B  be  the 

log  n,log  n-1 

class  of  paths  of   log  n   vertices.   From  LemjHa  A. 4  we  get 

Prob{G    contains  at  least  one  member  of  b} 

n,p 

=  0  I ~ )  =   0  (  — ^  )  -►  0    as   n  •*  °°       D 

\n.c.(n)-n-^°5  "/^°5  ""V       l-'n)  / 

The  above  implies  that  the  average  depth  of   G     is  less  than   log  n   for 

n,p 

p  =  o(n   )   and  all  t^  ^  ^   for  some   n  . 


A. 7 


A  .  3      The    Case    of      p  =  T.  (n      ) 


LE.MMAA.3.       (  [Erdcs ,    P.er.yi ,    59]).      Let     f=   .z   r.)-^ir.>      v-;-.--;      -  >   0      ''.-■   -: 

acnstar.t  ar.d     -.(n)  -"^     as     n-*^.      Zhen,      if     ,8,  is    the  r.uir.ber  cf  balcs^.ced 

^ycz'hs   c'   the   class     B  Tk     vertices,      ■-     edres,    ccrtainea  in     G  Je 


zcTJe 


?rob    ,  B,     -  ,'     -^ir.) 


'L(r.)/ 


Probi  ,3,      .  I    =    0 ;    =    0 


k,'-  V-.;n 


COROLLARY   A.  3 .      .he  derzh  c"     G  is     <  ioa  n      'cr     p  =  .'.  (n      ),   -jizh 

n,p 

^orc'catiliz'^i   .rsina   tc    1  as     n-*=°.  Zkis   i"T  lies   zhat ,   f:v     p  =  r.  (n   ),  the 

TJercce  derth  c^     G     is  net  r^.cve   than     Icq  n  ':r  all     n  >  n  ,  ^or  sorre      n.  >  0. 
'    -     •    n ,  c  u  u 

Proof.   Let   B   ,   be  the  class  of  Einarv  balanced  trees  of   n   vertices, 
n-1   edges.   Then 

n-1 


Prob{  Ib,  J  ^  1}  >  1  -  C 
k  ,jc 


\\u.(n)  / 


Hence 


Prob{|B,  nl^l}-*!     as    n -" 
k,  X. ' 


-■  ^3 
A. 4  The  Case  of  High  Densities,   p  >  en  '   ,   c  >  1 

LEMMA  A.  4.  The  depth  cf    G       ,    ^ov     p>  zn  is    <3,   vizh  ::  rot  ability 

tending  to  \  as     n  goes   to     °°.  Here     c  >  1  is  a  constant. 

Proof.   Consider  any  pair   u,v  of  vertices  of  G    .   Let   S 


n,p 


u 


(respectively   S  )   be  the  sets  of  neighbors  of   u   (respectively  of   v) 
Then,  for  anv   6  €  (0,1) 


A. 8 


-2 


Problls  •  ^p(n-l)  (1-6)  }  >1  -  e 
u 


(n-l)/2 


by  the  Chernoff  bounds  [Chernoff,  52].   For   d = 1/2  we  get 

2/3 


Sir.ilarly , 


Hence 


Frob 


Free 


Free 


S    ^ 

u 


s   ^ 

V 


-; ,  ?  1  -  t  "^ 


^    1  -  e-<=/^-"' 


:/3 


;cth   S  ,  ,  I 


=:    ^ 


en 


^    1 


,   .  -(:/S)r. 

1  -  ^e  ^ 


2/3 


Let   <f  be  the  event  "both    3  ^  :S    ^ 


en 


Then 


I  ^    1     ,  ^ 

Prob'no   edce    between   S    ,    5      given     c  ;    %:    (1-p)  ^  [1 

u    V 


c    4/3 
—  n 
c   \  4 


-  e-P  I" 


3  - 
fn 


Hence,  combining  terms, 


?robi3  path  of  length   ^3   between   u,v} 


(:-2e-^-4"^'=)(,..4M-).1.3e-^4n^'' 


^1-2 


Hence,   ProblV  pairs   u,v   3  path  of  length  ^  3  between  them} 


>  ,    ,2   4:/8)n2/^    ^ 

^  1  -  3n   e  r^  /      -'1    as   n  ^ ' 


/ 
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